Contrastive analysis for scatterplot-based representations of dimensionality reduction

نویسندگان

چکیده

Cluster interpretation after dimensionality reduction (DR) is a ubiquitous part of exploring multidimensional datasets. DR results are frequently represented by scatterplots, where spatial proximity encodes similarity among data samples. In the literature, techniques support understanding scatterplots’ organization visualizing importance features for cluster definition with layout enrichment strategies. However, current approaches usually focus on global information, hampering analysis whenever to understand differences clusters. Thus, this paper introduces methodology visually explore and interpret clusters’ formation based contrastive analysis. We also introduce bipartite graph relationship between statistical variables employed how influence formation. Our approach demonstrated through case studies, in which we two document collections related news articles tweets about COVID-19 symptoms. Finally, evaluate our quantitative demonstrate its robustness

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dimensionality Reduction for Data in Multiple Feature Representations

In solving complex visual learning tasks, adopting multiple descriptors to more precisely characterize the data has been a feasible way for improving performance. These representations are typically high dimensional and assume diverse forms. Thus finding a way to transform them into a unified space of lower dimension generally facilitates the underlying tasks, such as object recognition or clus...

متن کامل

Projection Pursuit-Based Dimensionality Reduction for Hyperspectral Analysis

Dimensionality Reduction (DR) has found many applications in hyperspectral image processing. This book chapter investigates Projection Pursuit (PP)-based Dimensionality Reduction, (PP-DR) which includes both Principal Components Analysis (PCA) and Independent Component Analysis (ICA) as special cases. Three approaches are developed for PP-DR. One is to use a Projection Index (PI) to produce pro...

متن کامل

Random walk-based fuzzy linear discriminant analysis for dimensionality reduction

Dealing with high-dimensional data has always been a major problem with the research of pattern recognition and machine learning, and linear discriminant analysis (LDA) is one of the most popular methods for dimensionality reduction. However, it suffers from the problem of being too sensitive to outliers. Hence to solve this problem, fuzzy membership can be introduced to enhance the performance...

متن کامل

Neighborhood Components Analysis for Reward-Based Dimensionality Reduction

There has been a great deal of research that attempts to explain the structure of biological receptive fields in terms of various methods for adapting basis vectors based on the statistical structure of visual input. These include principal components analysis (Hancock et al., 1992), independent components analysis (Bell & Sejnowski, 1997), non-negative matrix factorization (Lee & Seung, 1999),...

متن کامل

‏‎a phonological contrastive analysis of kurdish and english‎‏

deposite the different criticisms on contrastive analysis it has been proved that the results of it(when processed)can be usuful in a tefl environment,specially at the level of phonology.this study is an attempt to compare and contrast the sound systems of kurdish and english for pedagogical aims. the consonants,vowels,stress and intonation of the twolanguages are described by the same model-ta...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computers & Graphics

سال: 2021

ISSN: ['0097-8493', '1873-7684']

DOI: https://doi.org/10.1016/j.cag.2021.08.014